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A MARKOV MODEL OF A LIMIT ORDER BOOK: THRESHOLDS, RECURRENCE, 

AND TRADING STRATEGIES 

FRANK KELLY AND ELENA YUDOVINA 


Abstract. We analyze a tractable model of a limit order book on short time scales, where the dynamics 
are driven by stochastic fluctuations between supply and demand. We establish the existence of a limiting 
distribution for the highest bid, and for the lowest ask, where the limiting distributions are confined between 
two thresholds. We make extensive use of fluid limits in order to establish recurrence properties of the model. 
We use the model to analyze various high-frequency trading strategies, and comment on the Nash equilibria 
that emerge between high-frequency traders when a market in continuous time is replaced by frequent batch 
auctions. 


1. Introduction. 

A limit order book (LOB) is a trading mechanism for a single-commodity market. The mechanism is of 
significant interest to economists as a model of price formation. It is also used in many financial markets, 
and has generated extensive research, both empirical and theoretical: for a recent survey, see m- 

The detailed historic data from LOBs in financial markets has encouraged models able to replicate the 
observed statistical properties of these markets. Unfortunately, the added complexity usually makes the 
models less analytically tractable and, with relatively few exceptions, such models are explored by simulation 
or numerical methods. Our aim in this paper is to analyze a simple and tractable model of a LOB, first 
introduced by [22] and independently by m and by [19] . The basic form of the model explicitly excludes a 
number of significant features of real-world markets. Nevertheless we shall see that, from the model, several 
non-trivial and insightful results can be obtained on the structure of high-frequency trading strategies. 
Further, the model has a natural interpretation for a competitive and highly traded market on short time- 
scales, where the excluded features may be less significant. We believe the model may be helpful in discussions 
of market design, and as an illustration we use the model to comment on the Nash equilibria that emerge 
between high-frequency traders when a market in continuous time is replaced by frequent batch auctions. 

To motivate the model consider a market with only two classes of participant. Firstly, long-term investors 
who place orders for reasons exogenous to the modelQ who view the market as effectively efficient for 
their purposes, and who do not shade their orders strategically. Temporary imbalances between supply 
and demand from such long-term investors will cause prices to fluctuate even in the absence of any new 
information becoming available concerning the fundamentals of the underlying asset. Our second class of 
participant, high-frequency traders, attempt to benefit from these price fluctuations by providing liquidity 
between the long-term investors. In practice we should expect a spectrum of behavior between these two 
extremes. The extreme case, with just long-term investors and high-frequency traders, is clearly a caricature, 
but we shall see that it does allow us to analyze various high-frequency trading strategies (for example 
market-making, sniping and mixtures of these) and the Nash equilibria between them. 

We next describe the model of a LOB for an example involving long-term investors only, and outline our 
results for this example. A bid is an order to buy one unit, and an ask is an order to sell one unit. Each 
order has associated with it a price, a real number. Suppose that bids and asks arrive as independent Poisson 
processes of unit rate and that the prices associated with bids, respectively asks, are independent identically 
distributed random variables with density fb{x), respectively faix). An arriving bid is either added to the 
LOB, if it is lower than all asks present in the LOB, or it is matched to the lowest ask and both depart. 
Similarly an arriving ask is either added to the LOB, if it is higher than all bids present in the LOB, or it is 
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^For example, to manage their portfolios. Investors may differ in their preferences and in their valuations, even given the 
same information, which creates potential gains from trade. 
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matched to the highest bid and both depart. The LOB at time t is thus the set of bids and asks (with their 
prices), and our assumptions imply the LOB is a Markov process. 

For this model we show that there exists a threshold Kb with the following properties: for any x < Kb 
there is a finite time after which no arriving bids less than x are ever matched; and for any x > Kb the 
event that there are no bids greater than x in the LOB is recurrent. Similarly, with directions of inequality 
reversed, there exists a corresponding threshold Ka for asks. Further there is a density 7ra(a;), respectively 
TTb{x), supported on (Kb, Ka) giving the limiting distribution of the lowest ask, respectively highest bid, in 
the LOB. The densities i^aiT^b solve the equations 


(la) 

fb{x) 

/ 'Xa{y)dy 

lx 

(lb) 

fa{x) 

/ 'Xb{y)dy ■■ 
J Kb 


As a specific example, if fa{x) = fb{x) = l,x G (0,1), then Ka = k, Kb = 1 — k, 7ra(x) = 7rb(l — x), and 


( 2 ) 




X G (k,1 — k) 


where the value of k is given as follows. Let w be the unique solution of we^ = e“^: then w ~ 0.278 and 
K = w/{w + 1) « 0.218. Observe that any example with fa = fb can be reduced to this example by a 
monotone transformation of the price axis. 

The existence of thresholds with the claimed properties is a relatively straightforward result, using Kol¬ 
mogorov’s 0-1 law. In order to make the claimed distributional result precise the major challenge is to 
establish positive recurrence of certain binned models: such models arise naturally where, for example, 
prices are recorded to only a finite number of decimal places. Given a sufficiently strong notion of recurrence 
the intuition behind equations Q is straightforward: in equilibrium the right-hand side of equation (la) is 
the probability flux that the highest bid in the LOB is at x and that it is matched by an arriving ask with a 
price less than x, and the left-hand side is the probability flux that the lowest ask in the LOB is more than 
X and that an arriving bid enters the LOB at price x] these must balance, and a similar argument for the 
lowest ask leads to equation (lb). To establish positive recurrence of the binned models we make extensive 
use of fluid limits (see [2]), an important technique in the study of queueing networks. 

The orders we have described so far are called limit orders to distinguish them from market orders which 
request to be fulfilled immediately at the best available price. Market orders are straightforward to include 
in the model: in the specific example just described we simply associate a price 1 or 0 with a market bid or 
market ask respectively. As the proportion of market bids increases towards a critical threshold, w « 0.278 in 
the above example, the support of the limiting distributions tTq , iTb increases to approach the entire interval 
(0,1): above the threshold the model predicts recurring periods of time when there will exist either no 
highest bid or no lowest ask in the LOB. This conclusion necessarily holds, with the same critical threshold 
w, for any example with fa = fb- 

A LOB is a form of two-sided queue, the study of which dates at least to the early paper of m, who 
modeled a taxi-stand with arrivals of both taxis and travellers as a symmetric random walk. Recent the¬ 
oretical advances involve servers and customers with varying types and constraints on feasible matchings 
between servers and customers, with applications ranging from large-scale call centres to national waiting 
lists for organ transplants (cf. [TJ [53J |2I] ). Our interest in models of LOBs is in part due to the simplicity 
of the matchings in this particular application: types, as real variables, are totally ordered and so when an 
arriving order can be matched the match is uniquely defined. 

Next we comment on several important features of real-world markets that are missing from the above 
basic model of a LOB. We assume that orders (from investors) are never cancelled and that the arrival 
streams of orders, with their prices, are not dependent on the state of the LOB. These assumptions might 
be natural for orders from our long-term investors who view the market as effectively efficient for their 
purposes. These assumptions, and the related assumption of stationarity of the arrival streams, may also 
be natural for a high-volume market where there may be a substantial amount of trading activity even over 
time periods where no new information becomes available concerning the fundamentals of the underlying 
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asset. Mathematically the model may then be viewed as assuming a separation between the time-scale of 
trading, represented in the model, and a longer time-scale on which fundamentals change. 

The assumption that all orders are for a single unit is important mathematically for the derivation of 
equations 0 ; economically, it corresponds to an assumption of a competitive market where an investor does 
not need to think about the impact of her order size on the market. We note that a long-term investor 
placing a large order may attempt to be passive in her execution, so as not to move the price against her, by 
spreading the order in line with volume in the market; see [5]. The natural question then becomes over how 
long the order is spread, and the model can give insight here. We note, however, that our assumption of a 
separation between the time-scale of trading and the timescale on which fundamentals change, modeled by 
our assumption of stationarity of arrival streams, may no longer be tenable when the time taken by a large 
investor to complete an order increases. In markets with a relatively small set of participants with large 
orders other approaches may be necessary; see [7] for a discussion of trading protocols that complement limit 
order books for large strategic investors. 

Markets may contain traders other than long-term investors, and there is currently considerable interest 
in the effect of high-frequency trading on LOBs. Importantly, many high-frequency trading strategies are 
straightforward to represent within the model, since traders who can react immediately to an order entering 
the LOB may leave the Markov structure intact. Consider first the following sniping strategy for a single 
high-frequency trader: she immediately buys every bid that joins the LOB at price above q and every ask 
that joins the LOB at price below p, where p and q are chosen to balance the rates of these purchases. This 
model fits straightforwardly within our framework, and we show how to calculate the optimal values, for the 
high-frequency trader, of the constants p and q. A single trader might instead behave as a market maker 
and place an infinite number of bid, respectively ask, orders at p, respectively g, where Hb < P < q < Ka- We 
are again able to analyze this case. The optimal profit rate under the sniping strategy may beat that under 
the market making strategy: it does so for the specihc example above where /a(a:) = fb{x) = l,a: € (0,1), 
describes the order flow from long-term investors. But a third strategy, which combines market making and 
sniping, will generally beat both the individual strategies. 

The model also allows us to readily explore the equilibria that emerge when there are multiple high- 
frequency traders competing using market making or sniping strategies. There has been considerable discus¬ 
sion recently of the effects of competition between multiple high-frequency traders, and of proposals aimed 
to slow down markets. A key issue is that high-frequency traders may wastefully compete on the speed with 
which they can snipe an order, and as a regulatory response Budish et al. mm propose replacing a market 
continuous in time with frequent batch auctions, held perhaps several times a second. We consider Nash 
equilibria in continuous and batch markets when there are multiple high-frequency traders competing using 
mixtures of market making and sniping. Competition between market making traders reduces the bid-ask 
spread and the traders’ profit rate, and does so whether the market is continuous or batch. Competition 
between sniping traders in a batch market results in a Nash equilibrium with traders sniping bids above, 
respectively asks below, a central price; the traders’ profit rate is slightly less in a batch market than a 
continuous market. 

Competition between sniping strategies produces a large number of cancelled orders since if a strategy’s 
attempt to snipe an arriving order is not successful then the strategy immediately cancels its own order. A 
notable feature of data from real LOBs, that a substantial proportion of orders are immediately cancelled m, 
thus emerges as a deduction from, rather than an assumption of, the model. 

A discrete version of the model was first proposed by |22j in his pioneering work on regulation of secu¬ 
rities markets, and the model was independently introduced by m and by m- Taking stationarity as an 
assumption, [16] provided an extensive analysis of the model; our equations Q can be deduced from [T6l 
Proposition 1], assuming steady-state behavior, by setting time derivatives to zero. Our contribution is to 
establish the existence of the thresholds Ka,Kb and to prove a sufficiently strong notion of recurrence to 
justify the intuition behind equations 0 . 

Previous research similar in mathematical framework to that reported here is by Cont and coauthors PIS], 
by Simatos and coauthors Enin] building on Lakner et al. m. and by Toke ESj: as we do, these authors 
describe LOBs as Markovian systems of interacting queues and are able to obtain analytical expressions 
for various quantities of interest. In the models of P P dH ES Ej the arrival rates of orders at any 
given price depends on how far the price is above or below the current best ask or bid price; the models 
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of [13 mj 1131 US] are one-sided in that all bids are limit orders and all asks are market orders. Gao et 
ai. uni study the temporal evolution of the the shape of a LOB in the model of |S], under a scaling limit. 
Maglaras et al. m study a fragmented one-sided market in which traders may route their orders to one of 
several exchanges. The work of Lachappelle et al. (TS], building on Ro§u [5D], uses a different mathematical 
framework, that of a mean field game, but shares with our approach some important features. In particular, 
these authors distinguish between institutional investors whose decisions are independent of the immediate 
state of the LOB and high-frequency traders who trade as a consequence of the immediate state of the LOB. 
The models of both [5] and [13] keep detailed information on queue sizes only at the best bid and best ask 
prices; [6] shares with our approach a Markov description of the entire LOB. 

In much of the market microstructure literature features of LOBs, such as large bid-ask spreads, are 
explained as a consequence of participants protecting themselves from others with superior information. 
While this is clearly an important aspect of real-world markets we note that such features may also arise 
from simpler models. The driving force for the dynamics of the LOB in our approach, as in [T3l|20|, is not 
asymmetric information but stochastic fluctuations between supply and demand. 

The organisation of the paper is as follows. In Section [^ we describe precisely the model and our main 
results. Sect ion [^develops the scaffolding necessary for the proofs, which are given in Section]^ In Section[^ 
we describe some applications of our results: this section contains our discussion of market orders, and of 
high-frequency trading strategies and Nash equilibria. 

2. Model and results. 

The state of the LOB at time t is a pair of (possibly infinite) counting measures on K; Bt 

represents the prices of queued (not yet executed) bid orders, and At represents the prices of queued asks. 
New orders arrive as a labeled point process; the label records the type of order (bid or ask) and the price. 
Without loss of generality, we assume that the price axis has been continuously reparametrized so that all 
prices fall in the interval (0,1) (or, occasionally, [0,1]). 

Orders depart from the queue when an arriving order “matches” one of the orders already in the book. 
We shall need several notions of what it means for two prices to match, and to capture this we introduce a 
price equivalence function, that is a nondecreasing, not necessarily continuous, function V : [0,1] —>■ [0,1]. 
A bid-ask pair is compatible if 7^(bid) > 7^(ask) .We shall primarily consider two types of price equivalence 
function: V^x) = x, and the function that partitions all prices into n pricing bins. We will refer to the latter 
case, where the image of 7^ is a finite set, as the binned model. Note that the same price equivalence function 
is applied to the prices of all the orders, and compatibility of bid-ask pairs is unchanged under any strictly 
increasing transformation of the equivalence function. 

We are now ready to formally define the evolution limit order book Ct- 
Initial state: Initially, there should be no compatible bid-ask pairs in the book. Equivalently, the initial 
state {Bo,Ao) satisfies 

Bo[x,l)-Aoi0,y]=0 if V{y) <V{x). 

Most of the time we assume that the total number of orders in the book is finite; we relax this assumption 
in Section [^ where we allow an infinite number of orders to be placed at a single price, and otherwise the 
book is finite. 

Order arrival process: New orders arrive as a Poisson process with iid labels designating the type and 
price of the order. Unless specified otherwise, we assume that P(bid) = P(ask) = 1/20 We assume the labels 
of orders are independent and identically distributed, and in particular independent of the state of the book, 
but the distributions of prices may depend on type. We let Fa be the CDF of prices of arriving asks, and Ft, 
be the CDF of prices of arriving bids. We will often assume that the distributions of the prices of arriving 
orders have densities fa and /{, respectively; this entails no loss of generality, because the LOB evolution is 
defined by the combination of the arriving price distributions and a price equivalence function, and thus we 
can always assume that the arriving orders have densities and only become discontinuous after being put 
through the price equivalence function. 


^The Poisson structure is not important to the book, becau se all that matters is the sequence of order arrivals. Unequal 
rates of arrival for bids and asks are considered in Section 5.1.1 
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Change at order arrival: We do not allow cancellations in the model (until Section]^, so all changes to the 
state occur at the time of an order arrival. Suppose at time t a bid at price p arrives. If there is a matching 
ask in the book, i.e. if At-{0,y] > 0 for some y such that V{y) < Vi^x), then nothing happens to the bids 
in the book {Bt = Bt_), and the lowest ask departs: At = At- — 6q, where q = min{a; : At-{x} > 0}[^ . If 
there are no matching asks in the book, the bid joins the book: Bt = Bt- + 5p and At = At-. The situation 
is symmetric if the arriving order is an ask at price q: if there is a matching bid, the two orders depart (so 
At = At- and Bt = Bt- — dp where p = max{a; : Bt-{x} > 0}), and if there are no matching bids, then the 
ask joins the book {Bt = Bt- and At = At- + 6q). 


We will be keeping track of the highest (price of a) bid /3t and lowest (price of an) ask at in the book at 
time t. If an order departs the book at time t, it must be at price j3t- (if a bid) or at- (if an ask). We allow 
Bq{x} = oo or Ao{y} = oo; if this is the case, then no bids left of x, and no asks right of y, will ever depart 
the limit order book, since they will never be the highest bid (respectively lowest ask). 

Below, we will refer to continuous and discretized models of LOBs. A continuous LOB is one where the 
order price densities fa and /{, (exist and) are bounded above and below, and the price equivalence function 
is V{x) = X. Discretized models will use some binned price equivalence function, and will sometimes (but 
not always) assume that all bins receive a positive proportion of the orders of each type. 

For a discretized, binned LOB, we will use notation |a;] to denote the index of the bin containing x\ |a;] 
is a positive integer ranging from I to for some N > 0. 

We now present the main results concerning the model. The first result. Theorem |2.1[ establishes a 
transition at threshold values and Eventually bids arriving below kj,, and asks arriving above Ka, will 
never be executed; whereas all bids arriving above Hh, and all asks arriving below Ka, will be executed. The 
second result, Theorem ] 2.2 [ presents the distribution of the rightmost bid and leftmost ask. 

Theorem 2.1 (Thresholds). There exist prices Kb and Ka with the following properties: 

(1) For any e > 0 there exists, almost surely, a (random) time Tq < oo such that fit > Kb — e and 
at < Ka + e for all t >To. 

(2) For any e > 0, infinitely often there will be no orders with prices in {Kb + e, Ka — e). 

(3) Let X > Kb + e and y < Ka — e for some e > 0. Consider the LOB started with infinitely many bids 
at X, infinitely many asks at y, and finitely many orders in between. The evolution of the orders at 
priees in the interval {x,y) is a positive (Harris) recurrent Markov process, with finite expected time 
until there are no orders in the interval. 


The fact that there will infinitely often be no bids above x, and no asks below y, is a consequence of 
Kolmogorov’s 0-1 law; the challenge is to show that there will simultaneously be neither bids nor asks in the 
interval {x,y). In fact, we shall need to prove this part of Theorem 2.1 and Theorem 2.2 below in tandem. 


Theorem 2.2 (Distribution of the highest bid). Consider a continuous LOB; that is, V{x) = x, and the 
densities fb and fa are bounded above and below. Then 

(1) The limiting distributions of the highest bid and lowest ask have densities, denoted tt^ and tTq; let 

tUb — '^b/fb and VJa — r^affa- 

(2) The thresholds satisfy 0 < Kb < Ka < i, and also Fb{Kb) = 1 — Fa{Ka). 

(3) The distribution of the highest bid is such that Wb is the unique solution to the ordinary differential 
equation 

(~ 1 -Fb{x) =Wb{x)fb{x) 

with initial conditions 


{Fa{x)Wb{x))\x=n,, = 1, {Fa{x)Wb{x))'\a:=Ki = 0 

and the additional constraint Wb{x) —>■ 0 as x ^ Ka. The distribution of the lowest ask is determined 
by a similar ODE. 


^The minimum exists when the initial state of the book is finite, since only finitely many orders are present in the book. 
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Asymptotic bid densities 



Figure 1. Limiting density of the highest bid for the binned LOB with 50 bins, and limiting 
density for a continuous LOB (dotted line). Note the “shoulder” bin in the binned model: 
the threshold in the continuous LOB lies in the interior of this bin. 


Corollary 2.3 (Uniform arrivals). Suppose that V{x) = x and the arrival price distribution is uniform on 
(0,1) for both bids and asks. Then Kb = k ^ 0.218 is given by k = w/(w + l) where we'" = e~^. The limiting 
density of the highest bid is supported on (k, 1 — k), and is given by 


Wb[x) = 1(«,i-k)(1 - k) Q +log 


and the limiting density of the lowest ask is vJaix) = vJb(l — x). 

Remark 1 (Absolute continuity). We can replace conditions on the densities fa and fb by the requirement 
that dFa/dFb be bounded above and below; however, it is more natural to state the result of Theorem 2.2 


in terms of densities. The boundedness requirement avoids the trivial counterexamples fb = 21[q x/ 2 )! 
fa = 21 ( 1 / 2 , 1 ] (nonoverlapping supports, no orders leave) or fa = 21[q 1 / 2 ), fb = 21 (i/ 2 ^i] (nonoverlapping 
supports, no threshold). 

Through a reparametrization of the price axis. Corollary |2.3| covers all cases where arriving bid and ask 
prices have identical densities. We describe some other analytically tractable applications of Theorem |2.2| in 
Section We shall also, in Section extend the analysis to deal with some examples where the supports 
of the bid and ask price distributions do not coincide. 


Remark 2. The form of the limiting density appearing in Corollary |2.3| can be deduced from equations 
(63)-(64) of [ini Section 3] after applying a coordinate transformation to convert between [0,oo) and [0,1). 

In Figure we show the exact limiting distribution of the highest bid for the binned LOB with uniform 
arrivals over 50 bins, along with the limiting distribution for the continuous LOB. Note the “shoulder” bin: 
in the binned LOB, the threshold happens to fall into the middle of a bin, so the long-term probability of 
having the rightmost bid in the bin is positive but below the continuous limit. 

While we have been able to compute analytically the distribution of the location of the rightmost bid, 
there are many related quantities for which we do not have exact expressions in steady-state (although 
the positive recurrence established in Theorem |2.1| implies that they are well-defined and can be estimated 


consistently from simulation). Notably, except in the special case to be considered in Section 5.1.3 we have 


not been able to derive analytic expressions for the equilibrium height of the book (i.e. expected number of 
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bids or asks at a given price in the binned model), or for the joint distribution of the highest bid and lowest 
ask. For an illustration of the simulated joint density of the highest bid and lowest ask, see [^ . 

2.1. Brief summary of notation. We summarize here our notation, as well as some of the main assump¬ 
tions used in the text. 

£: limit order book. 

V'. price equivalence function, a monotone increasing function. Most of the time we use either V{x) = x ov 
the function that places all prices into one of several bins. 

At, Bt'. the counting measure of asks, respectively bids, at time t. 

Fa, Ff,: CDFs of the prices of arriving ask and bid orders. Until Section [5.1.H newly arriving orders are 
assumed to have equal probability of being a bid or an ask. In a binned model, we may write Fa^b{n) (with 
n an integer) to refer to the fraction of orders arriving into bins with index < n, i.e. the CDF evaluated at 
the rightmost endpoint of the interval. 

fa, fb- the corresponding densities, which are assumed to exist. For most results, fa and fb are assumed to 
be bounded above and below. 

at, Pt' the price of the lowest ask, respectively highest bid, at time t. Note that this is the actual price, not 
the bin containing it. 

|a;]: in a binned LOB, the index of the bin containing price x. 

Ka, Kb- limiting prices above (respectively below) which only finitely many asks (bids) are ever executed. It 
is not obvious a priori that Ka < 1 or Kb > 0; we prove this fact in Step 3 of the proof of Proposition |4.2[ 
For functions of two or more arguments, we may interchange arguments and subscripts: thus, fk,n(t) = 
fn{k,t) = f{k,n,t). We will use notation /„(fc, •) when we wish to consider / as a function of the third 
argument alone. 


3. Preliminary results: monotonicity. 

Before proving the main results, we erect some scaffolding. Part of its purpose is to allow us to transition 
between continuous LOBs (for which we expect to get differential equations in the answer) and binned 
models (which can be modeled as countable-state Markov chains). It will also allow us to compare LOBs 
with different arrival price distributions. 

Lemma |3.1| asserts that the state of the limit order book is Lipschitz in the initial state with Lipschitz 
constant 1: in particular, small perturbations in the arrival and matching patterns will lead to small per¬ 
turbations in the state of the book. Lemma 13.21 asserts that actions that decrease cumulative bid and ask 
queues by either shifting orders or removing them in bid-ask pairs will only decrease future queue sizes. 

Lemma 3.1 (Adding one order). Consider a limit order hook C, and let L differ from L by the addition of 
one bid at time 0; let their arrival processes and price equivalence functions be the same. Then at all times 
C differs from C either by the addition of one bid or by the removal of one ask. 

Proof. Proof. The roles of “bid” and “ask” are symmetric here. The claim clearly holds until the additional 
bid is the highest bid that departs from the system; once it does, C differs from C by the addition of a single 
ask, and the result follows by induction. □ 

Define cumulative queue sizes Qb{p,t) = Bt{0,p], Qa{p,t) = At[p, 1). (Note that we count bids from the 
left and asks from the right.) When we want to highlight the dependence on only one of the variables, we 
will drop the other variable into a subscript. 

Lemma 3.2 (Decreasing queues). Consider a limit order hook C, and let C differ from C by modifying 
the initial state in such a way that Qb{-,0) < Qb(-,0), Qa(',0) < (5a(',0) (as functions of price), and also 
Qt,(l,0) — Qa(0,0) = (5&(1)0) ~ Qa(0, 0). In words, to get from L to C, at time 0 we remove some bid-ask 
pairs, and/or shift some bids to the right, and/or shift some asks to the left. Then at all future times t > 0, 
Qbi’C) ^ Qbi’A) o,nd Qa{‘,t) ^ Qai’A) US functions of price. 

Proof. Proof. We show Qb "£ Qb, the argument for asks being identical. The argument proceeds by induction 
on time, i.e. the number of arrived orders. Throughout the proof, we use notation ft- = limgN^t /(s) for the 
left limit of a cadlag function /. 
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Consider first the arrival of a bid at time t and price p. For it to upset the inequality, it must stay 
in C but depart immediately in C; additionally, we need Qb{q,t—) = for some q > p. Note 

that if the bid departs immediately in C, the leftmost ask at at- must be compatible with p, and in 
particular there are no bids right of p: Qb{p,t—) = Qbi^jt—)- This, together with Qb{q,t—) = ) 

and Qbi‘,t—) < ), implies that Qb{l,t—) = Qb{l,t—). Since bid-ask departures occur in pairs, this 

in turn implies Qa(0,t—) = Qa{0,t—). But it is easy to see that if Qa{-,t—) < Qa{-,t—) and they are equal 
at 0, then dt- (the leftmost jump of Qai‘,t—)) and at- (the leftmost jump of Qai‘,t—)) satisfy dt- < at-, 
and hence the arriving bid actually departs immediately in C as well. 

Next consider the arrival of an ask at time t and price p. For it to upset the inequality, it must cause the 
departure of the highest bid in C, but not in C, and we must have Qb{q,t—) = Qb(q,t—) for some q > Pt- 
with V{Pt-) > T’(p). Now, in C there are no bids at prices > V{jp), hence Qb(l, t—) = lim£_>o Qb{p — £, t—)- 
However, this contradicts the inequality Qb(‘, t—) < Qbi‘, f—), since lime_>o Qbip — e, t—) < lime_>o Qb{Pt- — 
e,t—) < Qb{l,t—) — 1. (Note that inequalities may not be equalities if there are multiple bids at the same 
price.) □ 


We can use this lemma to compare two limit order books C and C with identical initial states and order 
arrival processes, but different price equivalence functions. Suppose the price equivalence function V merges 
some of the values that were distinguished by V. Then any bid-ask pair that is compatible in C is also 
compat ible in £; and possibly additional bid-ask pairs are compatible in C as well. This lets us apply 
Lemma 


3.2 


to conclude that fewer orders will be present in £. 

We can give an upper bound on the queue sizes in C by using a binned LOB with one more bin and a 
shifted arrival process. If C has a bid arrival at price x in bin k = [x], we let C have a bid arrival at some 
price in bin fc — 1. The ask arrivals are identical in C and C. (If bins of C are numbered 1 through N, then 
bins of C are numbered 0 through N] bids arrive into bins 0 through — 1 in £, while asks arrive into bins 
1 through N.) Under this arrangement, any bid-ask pair that is compatible in C was compatible in C as 
well, so C offers an upper bound on the queue sizes of C. Consequently we can bound a continuous LOB 
C both from above and from below by two binned LOBs with slightly different arrival price distributions. 
(Assuming the continuous LOB has arrival distributions supported on [0,1], the binned LOB providing the 
upper bound will have bid arrival distribution supported on [—e, 1 — e] and ask arrival distribution supported 
on [0,1].) 

Finally, when bin sizes are small, the difference in the arrival price distributions will be small, and we’ll 
use Lemma 


3.1 


to bound the rate at which the states of C and C diverge. This will let us show that the 


behavior of the continuous LOB converges in a suitable sense. 


4. Proof of main results. 


We begin by stating a weaker form of Theorem 2.1 


Proposition 4.1 (Weak thresholds). There exist prices kj, and Ka with the following properties: 

(1) For any e > 0 there exists, almost surely, a (random) time Tq < oo such that Pt > Hb and at < Ka 
for all t > To- 

(2) For any e > 0, infinitely often there will be no bids with price exceeding Kb + e. Similarly, infinitely 
often there will be no asks with price below Ka — e. 

(3) The threshold values Kb and Ka satisfy Fb{Kb) = 1 — Fa{Ka). 

In addition, suppose that the bid and ask price densities (exist and) are bounded above by M. Then the 
following holds: 

(4) For any e > 0, with probability 1, there exists a sequence of times T„ —> oo such that at time Tn 
there are no bids with prices above 'P{Kb) + e, and the number of asks with prices below 'P(Ka) — e is 
bounded above by 2{M + l)eT„. 


Remark 3. Although the compatibility of bid-ask pairs is driven by the price equivalence function V{x), the 
statements about k are in terms of x itself. This is because whenever there are compatible bid-ask pairs, the 
bid with the highest value of x and the ask with the lowest value of x always depart the book. In particular, 
in a binned model, Kb and Ka will usually fall in the middle of some bin; in this “shoulder” bin, a nontrivial 





fraction of the arriving orders remain in the book forever. In Figure ki, is approximately half-way through 
the “shoulder” bin. (It is also possible for Kb to form the edge of a bin.) 

Remark 4. Note that we make no assertions here about the behavior of n~^Tn as n —>■ oo: for the purposes of 
this proposition, this sequence may well tend to zero. The proof of Theorem |2.I| will show that for any e > 0 
the sequence = n“^T„(e) is in fact eventually bounded away from zero, with the bound depending 

on e. 


Proof. Proof. The first two claims follow from Kolmogorov’s O-I law. Consider the events 

Sb{x) = {finitely many bids will depart from prices < x} 

£a(x) = (finitely many asks will depart from prices > x}. 

Lemma [TT] shows that these events are in the tail cr-algebra of the arrival process. Since the arrival process 
consists of a sequence of independent and identically distributed events, Kolmogorov’s 0-1 law ensures that 
for each x, £b{x) has probability 0 or I (and similarly for £a{x)). Now let 


(3a) 


Kb = supja; : F{£b{x)) = 1}, Ka = infja; : P(£a(a:)) = 1}. 


(If the set whose extremum is to be taken is empty, we let = 0 or Kq = !•) The first two asserted properties 
now follow upon noticing that £b(x) C £b{y) for x > y, and that whenever there is a bid departure at price 
X, there must be no bids at prices higher than x. (The situation is similar for asks.) 

We next show that Fb{Kb) -\- Fa{Ka) = 1. From the strong law of large numbers for the arrival process and 
the O-I law above, we know that Fb{Kb) is the smallest limiting proportion of arriving bids that stay in the 
system: 


(3b) 


Fb(Kb) = lim inf -^(bids in the LOB at time t). 

t—>-OQ t 


A similar equality clearly holds for asks with 1 — Fa{Ka)- Since bids and asks always depart in pairs, a further 
appeal to the strong law of large numbers for the arrival process shows that we must have Fb{Kb) = 1 — Fa^Ka)■ 
The existence of times T„ as in part (4) of the theorem follows by a similar argument from the functional 
law of large numbers for the arrival processes. With probability 1, picking a large enough time r„ when 
there are no bids at prices above V{Kb) + e ensures that there are at most {Fb{Ka) + (M -|- l)e)T„ asks in 
the system. Since asks to the right of Ka arrive at rate (I — Fa{Ka)) = Fb{Kb) and eventually never leave, for 
large enough r„ there will be at most 2{M + I)eT„ asks at prices below Ka — e. □ 


This result is weaker than the positive recurrence we wish to prove eventually: in particular, it does not 
show that the total number of both types of orders between Kb and Ka is ever zero. To obtain statements 
about positive recurrence, we shall need to use fluid limit techniques, and our overall approach will be similar 
to that in Chapter 4]. The final proof of stability will use multiplicative Foster’s criterion (state-dependent 
drift), see [TSl Theorem 13.0.1]. In order to get there, we need to show that whenever there are many bids 
or asks in (/Cb,Ko), their number tends to decrease at some positive, bounded below, rate over long periods 
of time. This is a standard line of argument in queueing theory; but the challenge of the model is that the 
evolution of the queues depends on which queues are positive, rather than which queues are large. In general 
Markov chains of this form are very difficult to analyze m show that in general the stability of such chains 
is undecidable), but the special structure of our chain makes it amenable to analysis. The outline of the 
proof is as follows. 

(1) We work with binned LOBs. We begin by showing that, after appropriate rescaling, both the queue 
sizes and the local time of the highest bid (lowest ask) in each bin converge to a set of Lipschitz 
trajectories, which we call fluid limits. We then proceed to develop properties of the fluid limits. 

(2) We next show that all fluid limits tend to zero for bins (strictly) between |k;,] -|- I and |Ka| — 1. We 
exploit the equations and inequalities satisfied by fluid limits to show the following: 

(a) There is an interval [xq, yo] on which, whenever the fluid limit of the number of orders is positive, 
it decreases (at a rate bounded below). Therefore, after some time Tq (which depends on the 
initial state), the fluid limit will be zero on [a;o,yo]- The values xq and yo may not be bin 
boundaries. 
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(b) Following Tq, we will be able to bound from below the rate of increase of the local time of the 
rightmost bid on [xi,xq) for some xi < xq, and of the leftmost ask on (j/o, yi] for some yi > j/o- 
Since whenever the highest bid is in [xi, ccq) it has a positive chance of departing (and similarly 
for asks in (yoi2/i])! conclude that whenever the number of orders in [xi,yi] is large, it will 
decrease (at a rate bounded below). We repeat the argument until « [K{,,Ka]. The Xi 

and yi may not be bin boundaries in this step. 

(3) We show that if on some interval, all fluid limits converge to 0 in finite time, then the binned LOB 
is recurrent on that interval. (This step is standard for fluid limit arguments.) Since the number of 
bids in a continuous limit order book can be bounded from above by binned ones, this will also show 
recurrence of the continuous LOB. 


4.1. ODE of the limiting distribution. Our first result shows that the ODE which should describe the 
unique limit, as t —> oo, of the empirical distribution of the highest bid does in fact describe some such limit. 
In the process, we also establish 0 < «{, < Ka < 1. 

Proposition 4.2 (Weak distribution of the highest bid). Suppose the arrival price distributions have densi¬ 
ties bounded above and below, and consider a sequence of binned LOBs with the number of bins, N, tending 
to infinity. 

For each N and e > 0, let Tn = Tn{N,e) oo be the sequence of times identified in part @ of 
Proposition \4 b\ Lot 'iT},{n,N,e) be the discrete normalized empirical density of the highest bid over the 
time interval [0,T„]; that is, 

time up to Tn that the highest bid is in |x] 


(4a) 


Tri,{n, N,e,x) = 


Tn ■ (length of |x] j 

oo, N^oo, t^ci'^b{‘n,N,e) ;= TTf,, and similarly for asks. 


(1) There exists a unique limit lim, 

(2) Denoting vut = i^hlfb ond zUa = r^a!fa, these satisfy the pair of integral equations 

/*1 /*^a 


Fa{x)Wb{x)= / Wa(,y)fa{y)dy, X £ (Kf,, Ka); 


'Wb{x)fb{x)dx = 1 , 


(4b) 


pX pKa 

{l-Fb{x))Wa{x)= Wb{y)fb{y)dy, X ^ {Kb,Ka)-, / ZUa{x)fa{x)dx = 1. 

Jo J Kh 


(5a) 


(3) Moreover, wherever Wb is differentiable, it satisfies the ODE 

1 - Fb{x ), 


fa{x) 


{Fa{x)Wb{x)y = Wb{x)fb{x) 


with initial conditions 

(hb) (F)j (x)t475(x)) I a;—1, (F)j (x)tn5 (x)) la,—0 

and the additional constraint Wb{x) —>-0 as x f Ka. The distribution of the leftmost ask satisfies a 
similar ODE. 

(4) The equation (§ has a unique solution; in particular, Kb and Ka are uniquely determined by it. 

Remark 5 (Normalization and initial conditions). From the integral equation 0 it follows that Wb will 
be a continuous function of price, whereas TTf, may not be. In particular, if we are interested in piecewise 
continuous functions ff and fa, then vub will satisfy the ODE (5a) on each of the segments where fb and fa 
are continuous, and can be patched together from the requirement that Wb(x) and {Fa{x)vJb{x)y are both 
continuous. 

The initial conditions (5b) apply for LOBs with finite initial states. Consider instead a LOB C with an 
infinite bid order at some price p > Kb. As long as the threshold kb of C is positive, we can do away with the 
infinite order at p by changing the price equivalence function so that ’P(O) = V{kb) — 'P{p): the evolution 
of C and this new LOB C will be the same at prices above p after the threshold time. In C, there is yet 
another threshold kb, and the initial conditions (5b) hold for all x S {kb,p], meaning tx7;,(x) = 1/Fa{x) on 
that interval. Correspondingly, in C, the distribution of the highest bid price has an atom at p of mass 
1/Fa{x)dx. For the lowest ask price, we will of course have P(at < p) = 0, but it may be the case that 
V0a(x) ■/> 0 as X y p: it may be discontinous at the location of the infinite bid. 


10 







Remark 6. The computations in our Steps and below are similar to computations appearing in m 
Section 1]; we present the full argument for completeness. 


Proof. Proof. The proof proceeds as follows: 

(1) Fix the number of bins N, and consider the collection of empirical densities 7rb(n, TV, e), (n, e). 

Along any sequence n —)■ oo, N ^ oo, and e —>■ 0 there is a convergent subsequence. 

(2) Any subsequential limit satisfies a certain pair of integral equations, hence some ODEs. 

(3) The ODEs will directly imply m, < na', in addition, 0 < Kf, and «;„ < 1. 

(4) The solution to these ODEs is unique, and in particular the limit does not depend on the order of 
n —>■ oo, N —>■ oo, and e —0. 

Step 1: The space of probability distributions with compact support is compact, so along any sequence 
of empirical distributions there will be convergent subsequences. Moreover, whenever the highest bid is in 
bin Icc], bid departures occur from the bin at rate > Fa(a;)7rf,(|a;]), whereas bid arrivals occur into that bin at 
rate at most fb(x). Consequently, under the assumption of bounded densities ft, fa, the highest bid density 
< fb{x)/Fa{x) is bounded uniformly in n, N, e; this guarantees the existence of limiting densities 
along subsequences. Einally, the lower bound on fa and fb guarantees that vua and zub are bounded, and 
hence also converge along subsequences. For steps 2 and 3, TTa,b and zua^b refer to any such subsequential 
limit, taken along a single subsequence for all four quantities. 

Step 2: The integral equations are expressing the idea that the rate of bid arrival should be equal to the 
rate of bid departure. Along a sequence of times where the queues are small (i.e. e « 0), this is very nearly 
true; it will be exactly true in the limit e —>■ 0. The bid arrival rate at x is fb{x)F{at > x) = fb{x) 'iTa{y)dy, 
and the bid departure rate at x is Trb{x)Fa{x), so setting the two equal gives the result; the ODE is obtained 
by differentiating twice. 

Of course, if we fix the number of bins N, the limit distribution will be described by a difference equation 
rather than an integral (or differential) equation. It is standard to see that the limit of solutions to the 
difference equations solves the differential (or integral) equation. 

Step 3: To see Kb < Ka, note that izb is bounded above by fb/Fa always, so if it integrates to 1 we 
must have Kb < Ka- To see kj , > 0 (and Ka < 1), we consider a binned LOB C with three bins, with bin 
partitions at x and x + 6 for some x G {Kb, Ka). By monotonicity, |Kt,] = 1 and |Ka] = 3. Eor S small enough, 
the number of orders in the middle bin will eventually be stochastically dominated by a geometric random 
variable. Indeed, whenever there are bids in bin 2, more bids arrive at rate Fb{x + (5) — Fb{x) and depart 
at the larger rate Fa{x + 5) (this is after asks from bin 3 stop departing). The situation is similar for asks. 
Consequently, in C we must have 7rft(2) > 0 and 7ra(2) > 0. 

If 7ft,(l) and 7fa(3) were such that (almost) all orders depart, then from TTb{l)Fa{x) = Fb{x) we find 


7ff,(l)Fa(a;) = Fb{x) 


7ff,(2) = 


Fa{x) - Fb{x) 


Fa{x) > Fb{x). 


Fa{x) 

Now let (5 be small enough that Fa{x) > Fb{x + 5), and solve for Ttb{2) from the alternative expression 
^b{2)Fa{x + (5) = {Fb{x + d) - Fb{x))^a{^)- This gives 

Fb{x) Fb{x + 5) - Fb{x)l - Fa{x + 5) 


7fb(l) + TTb{2) = 


< I. 


Fa{x) Fa{x + 5) I - F’f,(a; +15) 

The contradiction shows that in fact in this LOB we must have 7f;,(I)Fa(a:) < Fb{x) — rj for some rj > 0, 
which implies Fb{kb) > rj. By monotonicity, we obtain k;, > 0 as well (for N large enough that the above 
bin of width 6 is one of the original bins of the LOB). 

Step 4: The uniqueness of solution follows from the fact that we have a second-order ODE with two 
initial conditions (which, as we just showed, are finite). Note that an alternative argument for Kb > 0 would 
be to show that TTb{x) > 0 for some x > 0, since then the ODE forces iZbFa/fb decreasing, and TTb{x) ~ 1/x 
near 0, which is not integrable. However, it is not immediately obvious why in a binned LOB the highest bid 
couldn’t spend (almost) all of its time in the leftmost bin, hence we give the more involved argument above. 

It is at this moment possible that there are multiple solutions to the ODE with different values for Kb. 
Intuitively, this should not be the case, since any limiting Kb should give the (unique) threshold value of the 
continuous LOB. We will derive the uniqueness of the quadruple {Kb, Ka, izb, TZa) from Lemma 4.3 below, which 
shows that the solution of (5a) is monotonic in the initial conditions. This implies that the requirements 
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XTiT '^b{x)dx = 1 and Fb(K{,) = 1 — Fa{Ka) pin down Kb and Ka uniquely, since decreasing Kb increases the 
initial value of Wb and D 


The second result we require about the ODE is monotonicity in the initial conditions: 


Lemma 4.3 (ODE monotonicity). Let Wb and Wb he two solutions of the ODE (5aI with initial conditions 

Wb{xo) > Wb{xo), {wb)'{xo) > {wb)'{xo). 

Then for all x > Xq, zub(x) > Wb(x). 


Proof. Proof. We may reparametrize space monotonically so that Fa{x) = x. Then the ODE (5a) becomes 

f d \ ^ d 

{Fb{x) - 1) ( x—Wb{x) + Wb{x) ) + xfb{x)—Wb{x) = 0, 


dx 


which is a first-order ODE in -^Wb{x). Since solutions of first-order ODEs are increasing in their initial 
conditions, we obtain -^zub(x) > -^Wb{x), and the desired inequality follows trivially. □ 


Corollary 4.4. Suppose the initial conditions for zub come ( |5b[ ), and the initial conditions for Wb are 
Fa{xo)wbixo) = 1, (Fa(x)u7b(x))%^xg = 0 for some xq > Kb. Then zub{xo) < Wb{xo) and ^Wb{x)\x=xo < 
^Wb{x)\x=xo- Consequently, Wb{x) < Wb{x) for x > Xq. 

Proof. Proof. Reparametrize space as before, so that Fa{x) = x. Then 


{xWb{x))' = -7ro(x), Wb{x) 


J ^a{y)dy'^ . 


From this it is clear that Wbixo) < Wb(xo). Further, 


d I 1 f"" 

X — Wb(x) = -TTaix) - ZUb{x) = - h - / {zUa{y) - Wa{x))dy. 

dx X x Jq 

Now, in a LOB, (1 — Fb{x))vja(x) is increasing (cf. xvJb{x) which is decreasing), meaning Wa is increasing. 
Consequently, the integral above is nonpositive, and we see 


^^'^biaf)\x—XQ Si ^ — 0^0 ^^'^b(.a^')\x—XQ 


as required. 


□ 


4.2. Fluid limits. In this section we introduce the fluid-scaled processes associated with the limit order 
book, discuss their convergence to fluid limits, and determine properties of the limits. Throughout the 
section, we work with a binned limit order book. 

Let Bk{-) and Ak{-) be the arrival processes of bids and asks into bin k (indexed by time). The time 
structure of these processes is not important for our results, so we may assume that these are Poisson 
processes; by definition, they are independent. We will assume that the total arrival rate of bids is 1, and 
also of asks, so that if pf,(fc) (respectively Pa{k)) is the probability that an arriving bid (ask) falls into bin k, 
this is also the arrival rate of bids (asks) into that bin. Let Qb{k,t) (respectively Qa{k,t)) be the number 
of bids (asks) in bin k at time t. Let Tii{k,f) and Ta(k,t) be the amount of time up to time t when the 
rightmost bid, respectively leftmost ask, is in bin k: that is, 

s] = fc}ds, Ta{k,t) = f l{|as] = fcjds. 

Jo 

It is clear that the initial data Qbik, 0), Qa{k, 0) together with the arrival processes Bk{-), Ak{-) give sufficient 
information to determine the values of all of these processes at later times. We have the following expressions: 
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(6a) 

(6b) 

(6c) 

(6d) 

(6e) 

(6f) 


ll3tl = k Qb{k,t) >0, ^ Qb[k',t) = 0 

k’>k 

|at] = k Qa{k,t) >0, ^ Qa{k’,t) = 0 


k' <k 


Qb{k,t) = Qbik,0) + f l{|a(s)] > A:}dBfc(s) - ^ f l{|/3(s)] =/c}d^fc/(s) 
Qaik,t) = Qaik,0) + f l{|/3(s)] < fc}dAfe(s) - ^ f l{|a(s)] = fcjdSfc'(s) 

•^0 fe'>fc-^0 


T 0 {k,t)=T/ 3 {k,O)+ l{|^(s)] = fcjds 

Jo 

Ta{k,t) =Taik,0) + [ l{|a(s)] = fc}(is 
Jo 


We define the fluid-scaled processes by Xn{t) = n ^X{nt) for any process X. We now have the following 
result on convergence to fluid limits: 

Theorem 4.5 (Convergence to fluid limits). Consider a sequence of processes 

{Bn{k,-),A„ik, •),Qb,„(fc,-),Qa,n(fc>-),'r/3,n(fc, •),7’a.n(fc, •)) 

whose initial state (at time 0) is bounded: | |Q^ „(fc, 0), Qf, „(A:, 0)|| <1. As n ^ oo, any such sequence has 
a subsequence which converges, uniformly on compact sets of t, to a collection of Lipschitz functions 

{bk{-),ak{-),qb{k, ■),qa{k,-),Tfi{k,-),Ta{k, •))• 

(Different subsequences may converge to different 6 -tuples of Lipschitz functions.) We call the limiting 6 -tuple 
a fluid limit. 

Any fluid limit satisfies the following equations almost everywhere (i.e. everywhere where the derivatives 


are defined): 




(7a) 

b'kit) =Pbik), 4(t)=Pa(fc) 


(7b) 

^ iT-0{k,t)) 

= 0 i/ ^ qb{k',t) >0, — 

II 

O 


k'>k 



I^al 



(7c) 

^ T0{k,t)=t, ^ Ta{k,t)=t 






(7d) 

qb{k,t) > 0, 

qa{k,t) > 0 


(7e) 

d . 

0ifqb{k,t)=0, ^qaik,t) . 

= 0 ifqa{k,t) = 

(7f) 


k'>k k'<k 

(7g) 


51 - §:Ta{k,t) Pb{k'). 


k’ <k 

k'>k 

Proof. Proof. 

The expressions in ([^ together with the 

functional law 


k' <k 


processes lead to the u.o.c. convergence along subsequences to a fluid limit. The integral representation 
implies that limits must be Lipschitz functions. 

To see that any fluid limit must satisfy Q, we note that ( |7^ follows directly for the functional law of 
large numbers for the arrival processes. Identities (7b) follows from the corresponding statement for prelimit 
processes: if J 2 k'>k 9 b{k',s) > e > 0 on a time interval s S (t — e, t + e), then for all sufficiently large n, 
'l2k'>kQb,n{k',ns) > ne/2 > 0, so |/3(ns)] > k and Tp^n{k,ns) is not increasing. Identity (7c) holds because 
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the rightmost bid (leftmost ask) is eventually always in one of the bins in the prelimit processes, so this must 
be true in the limit. Identities (7d) follows for a similar reason: prelimit queues are nonnegative, hence the 
limit is nonnegative as well. 


Identity (Tel is a corollary of (7d): a process that is always nonnegative, differentiable at t, and equal to 


0 at t must have derivative 0 there. 

Finally, identities and ( [7g| ) follow from ([ 6 ^-([ 6 d|) for the prelimit queues. More precisely, the rate at 
which the bid queue size changes is as follows: if the lowest ask is higher than bin k, then bids arrive into 
the queue at rate Pb{k); and if the highest bid is in bin k, then all asks arriving at prices below k deplete 
the queue at k. Because the location of the highest bid or lowest ask does not show up in the fluid limit, we 
instead use the local times tp and t^. □ 

We introduce notation 7 r^(/c,t) = ^r/ 3 (fc,t), Tra{k,t) = -^Ta{k,t). 

4.3. Fluid limits drain. We will now show that in a LOB that starts with infinitely many bids in |«:t,] + 1 
and asks in [kq] — 1 , the fluid limit queue sizes drain, i.e. converge to 0 on the bins ranging from |Kf,] + 1 
to |Ka] — 1. We will assume that bin widths (and hence Pb{k), Pa{k)) are all small. This is the meat of the 
argument in the paper. 

Theorem 4.6 (Fluid limits drain). Consider a fluid limit corresponding to a binned LOB with N bins. 
Suppose the arrival process is symmetric (pb{k) = PaiN — k)), the probabilities Pa,b{k) are bounded below, 
and pb{k) is decreasing in k (pa{k) increasing in k). Suppose that initially there are infinitely many bids 
in bin + 1 and infinitely many asks in |Kal ~ 1 ; then the fluid limit of gueues can he described by 
eia,b{k,t) for |k;,] + 2 < fc < |«;a] — 2 , and the fluid limit of the local times can be described by TTa,b{k,t) for 
[Khl + 1 < fc < [Kal - 1- 

Let the initial state of the fluid limit satisfy || (^{,(0), 90(0)1] < 1. There exists e = e{N) —>■ 0 as iV —>■ oo, and 
a time T depending on {pa{k),pb{k), bin widths}, such that for all bins k satisfying |k{, + e] < fc < [ko — e], 
and all times t > T, 

9 b(fc, t) = 0, 9 a(fc, t) =0, Vt > T. 

] < fc < l^o — e] and for t > T, the derivatives 7 r^(fc,t) satisfy the 


Further, in the interval |k{, - 
second-order difference equation 


1 - Fb{k) 


(Fa{k) 


^Pa{k + 1 ) 

where the operator is given by Afc(/) = /(fc 
Fa{\Kb + e]) 

PbCK + el) 

A similar equation holds for asks. 


■nffk) = Tiffk + 1 ), 


\Pb{k) 

f 1) — /(fc). The initial conditions satisfy 


+ el) < 1 , ^ 0 . 


As N —^ oo, the solution of the difference equation converges to the 


solution of the ODE (5a I with initial conditions given by (5b). 


Note that Ka and Kb are the thresholds of an LOB with a finite starting state; the LOB with infinite 
bid and ask orders can be thought of as having different thresholds kb < Kb and ka > Ka. For large N, 
Lemma 3.1 implies kb ~ Kb and ka ~ Ka. 


Proof. Proof. The proof proceeds in stages. 

Stage 0. Let xg be given by Fa{xo) = Fb{Ka) - Fb{xo), and let yo be given by 1 - Fb{xo) = (1 - Fa{Kb)) - 
(1 - Fa{yo)). Equivalently, Fa{xo) + Fb(xo) = 2xo = Fb{Ka), so xq = iFb^Ka), and j/q = ^(1 + Eo(k&)). 

Claim 0.1: Kb < Xq < < Ka. 

Proof: Note that Fa{Kb) is a lower bound on the rate of bid departure from the Markov chain when there 
are any bids present, while Fb(Kb) — Fb{Ka) is an upper bound on the rate of bid arrival. Consequently, if 
Fa(Kb) > Fb{Kb) — Fb{Ka), then the number of bids on the entire interval {Kb,Ka) would be stochastically 
bounded, whereas it should scale as a random walk. A similar argument gives yo < Finally, \Fb{Ka) = 
^(1 — Fa{Kb)) < 5(1 + Fa{Kb)), since K{, > 0 by Proposition 


4.2 


□ 


Claim 0.2: There exists Tg =To{M) such that for all times t > Tq and all fluid models, ^[^j^^j^j^( 9 ;,(fc, t)+ 

qa{k,t)) = 0 . 
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Proof: Since these processes are absolutely continuous and nonnegative, it suffices to show that when¬ 
ever there are any fluid orders in the interval (and all the derivatives are defined), the fluid number of 
orders in the interval decreases at a rate bounded below. By and ( [7g| ), we see that for Q{t) = 
^Ia:ol+l<fc<lKal-l 


Q'it) < 


0 , 


Q{t) = 0 

Fb{xo) - Fa{xo) - e, Q{t) > 0. 


Consequently, after a finite amount of time there will be no fluid bids in bins > |a:o] -t-1. Similarly, after 
a finite amount of time Ta,o, there will be no fluid asks in bins < |?/ol ~ 1; we may take Tg = max(Tf,_ 0 i Fafi)- 

Claim 0.3: There exists eg > 0 such that for all times t > Tg and all fluid models, X]fe<|a;ol 
and X]fc>|yol '’^oi{k,f) > eg. (This result requires bins to be sufficiently small.) 

Proof: Note that equations and ( |7g[ ) hold at all times, even when there are no fluid orders in the bin; 

thus, for t > Tg and all k S [|a:g] -I- 1, [j/g] — 1] we have 


Pb{k) ^ TTaik' ,t) = TTi3{k,t) ^ Pa{k'), Pa{k) ^ 'Kp{k' ,t) = TTa{k,t) ^ Pb(fc'). 

k'>k k'<k k'<k k'>k 


Omitting the dependence on t for clarity, these equations, together with the observation that J2k'^o‘{k) = 
^^7r,g(fc) = 1, can be rearranged to give two decoupled second-order difference equations for 'Xa{k) and 
7 r^(A:). We abuse notation to write Fa(k) = J2k'<kPa{k') and similarly for Fb{k). 

(8a) Ak(^^^=^^yAk(^^Mk)^^=Mk + l), Nl + l<fc<M-l. 

(There is a corresponding equation for tTo, of course.) 

If we had two initial conditions for this second-order difference equation, we would be able to solve it. 
Unfortunately, in general we do not have such initial conditions, but we have bounds on them, namely 


These inequalities would hold with equality in a different limit order book /!g, in which we assign the same 
low price to all the bins up through [ccg] -I-1, and the same high price to all the bins from |yg|]_— 1 up. (We 

shows that 


4.4 


nonetheless keep track the bins containing the highest bid and lowest ask of C.) Corollary 
the solutions to (|^ on |a:g] -f 1 < fc < |?/g] — 1 are bounded from above by the solution for C. (The result is 
in continuous space, but the arguments work just as well for difference equations.) We refer to the solution 
for C as TT^ and tt-q,. 

Using the trivial upper bound on ^/ 3 {k) for k > |yg], we find 


(9) 




Iyol-1 I^al-i 

fe=Ia;ol+i fe=Iyol 


Pbjk) 

Faik)- 


Notice that np must equal (Fa(fc)) ^Pb{k) for !«;&] -I-1 < fc < |a:g], as bids will not be queueing in those bins. 
Consequently, for the first term in the right-hand side of ([^ we have the bound 


lyol-i 

^p{k) < 1 


fe=Ia:ol+l 


E 


fc=lK 6 l + l 


Pb{k) 

Fa{k) 


< 1 - 


[Kal-l 

E 

fe=Iyol 


Pbjk) 

Fa{k) 


— eo, 


as long as the bins are narrow enough. Indeed, notice that Xq — Kb > xq — Kb = Ka — Vo (from monotonicity 
of C vs. C and symmetry), the denominator is increasing in k, and the bid arrival density decreases with 
translation to the right. We require the bins to be narrow enough that the sums are all nonempty. 

Stage 1. We now let xi, 2/1 be defined by Fb(xg)-Ub(xi) = eoF^ixi) a.nd Fa{yi)-Fa{yo) = eo{l-Fb{yi)). 
Similarly to the argument for Stage 0, there exists a time Ti such that for all t > Ti there will be no fluid 
queues on [|xi] -I- I, I 2 / 1 ] — 1]. Indeed, if there are fluid bids in the interval [|xi] + 1, |xg]], then whenever 
the highest bid is below |xg] it is in fact in this interval; the defining inequality then means that the fluid 
amount of bids in this interval decreases, and similarly for asks. 
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Next, we use the difference equation description on [|a;i] + 1, [yol — 1] to show that after Ti, the highest 
bid spends at least ei > 0 of its time below xi. This will require comparison against a different restricted 
LOB £i, where we merge all prices up to |a;i] + 1 and from [yi] — 1. 

Subsequent stages. We can now construct a nested sequence of intervals ... < X 2 < xi < xq < yo < 
yi < 2/2 < ■ • ■) where the inequalities are strict provided bins are narrow enough. It remains to show 
that limfc_,.oo,Ar->.oo 2 ;/c = Kb and limfc_,.oo_Ar->.oo 2/fc = (Note that N —?> oo, i.e. thinner bins, is certainly 
necessary for this to hold!) 

This result follows from the fact that can be taken to be bounded below: 


( 10 ) 


[a;;! 

E 


k=lKb\ + l 


Pbjk) 

Fa{k) 


^ Pbjk) ^ / 1 



(n(M)-F,([«J + l)). 


As long as Xi is bounded away from Kb (and bin widths are small enough), this will be bounded below, and 
therefore Xi — Xi+i and y^+i — yi will be bounded below. 

Convergence to ODE. The convergence of bounded solutions to difference equations to solutions of an 
ODE is standard. The argument above gives an inequality for the initial conditions, but note that as we 
approach Kb the initial conditions become exact. Indeed, 


Fa{Kb + €)wb{Kb + e) = / Wa{x)fa{x)dx 
J Kb + f. 

since the lowest ask will never be below Kb- Also, 

{Faix)Wb{x)y\a:=K.b+e = Z^a(K& + c) = -(1 - Fb{Kb + e))~^ / Wb(x)fb(x)dx 0, 

Jo 

since the highest bid density is bounded. 


1 , 


Putting this result together with Proposition |4.2| shows that, for symmetric distributions pb, Pa with pb 
decreasing, the fluid limits Tr/ 3 (k,t)/pb{k), TTa{k,t)/pa{k) will approach, as t —>■ oo and N —)■ oo, the solution 
of the ODE 0, uniformly on compact subsets of («{,, Ko)- 


Remark 7. The argument leading to the inequality (10) implies that the joint density of the highest bid and 
lowest ask must be bounded away from zero on at least a fraction of the boundary of the support, i.e. the 
probability of the event “there are no asks below Ka and the highest bid is at Kb + x” should be 0{x) but not 
o{x). In fact, the simulated joint density in [2tij is bounded away from 0 everywhere except the very corner 
(highest bid at Kb and lowest ask at Ka)- 


It remains to show that stability of fluid limits implies positive recurrence of the Markov chain. 


Lemma 4.7 (Fluid stability and positive recurrence). Consider a LOB satisfying the assumptions of Theo- 
rem \4-(^ Suppose that on some interval of bins ko < k < ki, all fluid limits with initial state bounded above 
by I satisfy the following: there exists a time T (depending on {pa{k),pb{k), bin widths}), such that for all 
times t>T, 

qb{k, f) = 0, qa{k, t) = 0, kg < k < ki, t > T. 

Consider a limit order book C started with infinitely many bids in bin kg — 1 and infinitely many asks in 
bin ki + 1; its state is described by the Markov chain of queue sizes in bins kg < k < ki- The Markov chain 
associated with L is positive recurrent. 


Proof. Proof. To go between fluid stability and positive recurrence, we use multiplicative Foster’s criterion 
[T51 Theorem 13.0.1]. Let 

Q{t) = \\{Qb{k,t),Qa{k,t))ko<k<kb\\ , 


and let C be sufficiently large. Let <5(0) — q > C, and consider the fluid scaling Q^j,{k,t) = q~^Qa,b{k,qt). 

By Theorem 4.5 if C and hence q is large enough, there exists a fluid limit {qa{k, t), qb(k, t), Taik, t), Tpfk, t))kg<k<ki 
satisfying \\qj(k,t),qb{k,t)\\ = 1 , such that 


F{\\Q^{k,t) - qa{k,t),Qb{k,t) - qb{k,t)\\ > e) < e for all t e [0,T]. 
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In particular, V{\\Qa{k,qT),Qa{k,qT)\\ > eg) < e. Note further that \\Qa{k,qT),Qb{k,qT)\\ < A{qT) + 
B{qT) is bounded by the arrival process, and hence has all moments. Thus, we conclude 

^.IWQaik, qT),Qbik, qT)\\] < e(l + 2T)q. 

Choosing e < (1 + 2T)~^ completes the proof. □ 

4.4. General order price distributions. It remains to remove the extra conditions (symmetric and de¬ 
creasing) on the order price distributions, and finish the argument for continuous limit order books. This 
requires two observations: 

(1) Recall that a continuous LOB could be bounded by two discrete LOBs with different arrival price 
distributions (in one of them, we shift all arriving bids one bin to the left). This shifted arrival 
distribution no longer satisfies the absolute continuity conditions, but nevertheless, Lemma [XT] shows 
that all of the above fluid-scaled arguments work for it as bin size shrinks to 0. Specifically, we model 
the bid arrivals as shifting the rightmost bin of bids all the way to the left, and then the difference 
between the two books is at most two bins’ worth of arrivals over the fluid time interval [0,T], 
which will be small provided bins are narrow. This allows us to conclude the positive recurrence 
of a continuous LOB with infinitely many bids at price V{Kb) + e and infinitely many asks at price 
'P{Ka) — e, provided the densities fa-, fb are bounded above and below, symmetric, and fb is decreasing. 


( 11 ) 


(2) By Lemma 3.2 replacing the bid arrival price distribution by another distribution with stochasti¬ 


cally higher prices, and/or replacing the ask arrival price distribution by another distribution with 
stochastically lower prices, results in fewer orders in a book. In particular, if we have shown the 
positive recurrence of an LOB with an infinite supply of bids at price p and asks at price q with a 
particular arrival distribution, the LOB will remain positive recurrent when we switch to an arrival 
price distribution with bids further right, and asks further left. Notice that as long as there are 
bids in the interval (p, q), they evolve on that interval identically whether or not there is an infinite 
supply of bids at p; and similarly for asks. This can be used to show that fluid limits drain in the 
new LOB on the interval {p,q). 

In the new LOB with the shifted price distribution, {p, q) may not be close to {kb, ka), so we will 
be wanting to extend the interval, as in Claim 0.3 of Theorem |4.6| The argument there does not 
use the full extent of the symmetry and monotonicity conditions; they are only used to prove the 
inequality 

M /,X [Kal-l ,js 

Pb{k) ^ Pb{k) , 


E 


Fa{k) 


> 


^ Fa{k) 

fc=IM+i ' ^ ^ ^ 

for some e > 0. For this inequality to hold, it is entirely sufficient to have 

r 


-dx > 


Mx) 


dx - 


Jkt Fa{x) Jq Fa{x) 
with no constraints on what happens between p and q. 

Consequently, for a general pair of densities {fb, fa) bounded below and above, we begin by 
finding fb,o,fa,o with Fb.g > Fb, Fap ^ Fa which are symmetric and for which fb.g is decreasing. 
(For example, we may take fbp = fa,o = miii(/a, fb) on most of the interval, with fbp taking a large 
value near 0, and fa,o taking a large value near I.) We use Theorem 4.6 to show that fluid limits 
drain for fb,o,fa,o (and hence, by Lemma [3.2[ also for {fb,fa)) on an interval {nbp, i^afl)- We then 
modify fb^i on (0, Ko,o) and fa,i on {Kbp, 1) to find the next pair of bounded densities {fbp, fa,i) for 
which Fbft > Fb^i > Fb, Fa < Fap < Fap, and ( |Tl] ) holds. We already know from monotonicity that 
fluid limits will drain for these distributions on (k 6 , 0 ) «^a,o)) and we use the inequality for p < Kbp 
and q > Kap to extend fluid stability to the bigger interval {Kb,i, Ka,i)- We repeat the process until 
the interval {Kb,n, Ha.n) approaches the entire interval {kb,ka) for {fb,fa)- 

To see that it will indeed approach the entire interval, notice that all that really matters for the 
thresholds of a LOB is Fap{x),Kb < x < Ka', it is immaterial what fb and fa do outside of those 
intervals, so long as they integrate to the correct amounts. Consequently, if Kb^n > kb + e, it must 
be that Fb^n < Fb or Fa^n > Fa somewhere on [Kb,n, Ka,n], which means that the process won’t get 
“stuck” until Kb^n \ kb and Ka,n ka- 
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5. Discussion. 


In this section we discuss several applications of our methods and results. We begin with a discussion of 
market orders and then consider various simple trading strategies. 

5.1. Market orders. The orders we have considered so far, each with a price attached, are called limit 
orders. Suppose that, in addition to limit orders, there are also market orders which request to be fulfilled 
immediately at the best available price. Suppose that limit order bids and asks arrive as independent Poisson 
processes of rates Ub, Va respectively; and that the prices associated with limit order bids, respectively asks, 
are independent identically distributed random variables with density fb{x), respectively fa{x). Without 
loss of generality we may assume that a; G (0,1). In addition suppose that there are independent Poisson 
arrival streams of market order bids and asks of rates fib, k-a respectively. Then these correspond to extreme 
limit orders: we simply associate a price 1 or 0 with a market bid or market ask respectively. 

Note that, in addition to market orders, we have also allowed an asymmetry in arrival rates between bid 
and ask orders. The intuition behind equations 0 leads to the generalization 


(12a) 


(12b) 


Vbfb{x) / T^a{y)dy = TTbix) { fla + l^a / fa{y)dy 


'^afa{x) / TTb{y)dy = TTaix) f Vb / fb{y)dy + fib 

Kb \ J X 


although now the existence of a solution to these equations satisfying the required boundary conditions is 
not assured, and the deduction of the recurrence properties necessary for an interpretation of TTb(x),T^a{x) as 
limiting densities may fail. To illustrate some of the possibilities we shall look in detail at a simple example. 
Suppose fa[x) = fbix) = l,x G (0,1), j/q = Ufc = 1 — A and fia = kb = A. Thus a proportion A of all 


orders are market orders. Use the notation 7 rb(-\; x), 7ra(-\; x) for the solution to equations (121 satisfying 
the required boundary conditions in this example. Then provided A < w « 0.278, the unique solution of 
we'^ = e~^, this solution has 7 ra(A;x) = 7rf,(A; 1 — a:) and 


(13) 


Trb{X]x) = 


1-A 
1 + A 




1 + A 
1-A 


A 


X — 


1 - A 


X G (n(A), 1 — «:(A)) 


where TTb{-) is the earlier solution 0 and 


c(A) = 


1 + A 
1-A 


w 

1 + ic 


A 

1-A’ 


Indeed, provided X < w the model is simply a rescaled version of the earlier model with distribution (13) 


having a support increased from (k, 1 — k) to the wider interval (k(A), 1 — k{X)). The inclusion of market 
orders in the model causes the price distributions to have atoms and not to be absolutely continuous with 
respect to each other; but nevertheless the analysis of earlier sections continues to apply since the market 
orders arrive outside of the range (k(A), 1 — n(A)). 

Next we explore this example as A f and the support becomes the entire interval (0,1). In our model a 
market order bid, respectively ask, which arrives when there are no ask, respectively bid, limit orders in the 
order book waits until it can be matched. When X < w there is a finite (random) time after which the order 
book always contains limit orders of both types and no market orders of either type and hence the analysis 
of previous sections applies. But if A > w then infinitely often there will be no asks in the order book and 
infinitely often there will be no bids in the order book, with probability 1. Now the difference between the 
number of bid and ask orders in the limit book is a simple symmetric random walk and hence null recurrent. 
There will infinitely often be periods when the state of the order book contains limit orders of both types 
and no market orders of either type, but such states cannot be positive recurrent. 

In the model described above an arriving market order which cannot be matched immediately must wait 
until it can be matched. If instead such orders are lost then we obtain a model which can be analyzed by 


the methods in Section 5.2.1 namely, we start the LOB with an infinite bid order at 0 and an infinite ask 
order at 1. 
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5.1.1. Differing arrival rates. Our analysis in earlier sections assumed bids and asks arrived at the same 
rate. This was without loss of essential generality, as it is convenient to illustrate now with a discussion of 


equations (12 1 when fa{x) = fbix) = l,x G (0,1), Va, > 0 and Ha = ti-b — 0. The solution to equations (12) 
satisfying the required boundary conditions is then 

1, Z' 1 — x\ 1 . /I — Kq 


TTbix) = Ko - + log 


-log 


X G {Kb,Ka) 


VaKa = t'h(l - Kb) 


(14) 

where 

(15) 

and Kb is the unique solution to 

' Kb{Va/vb -1 + Kb) J V ' Vb J ^ - Kb 
Although Va and Vb may differ, provided they are both positive the thresholds Ka and Kb are both inside the 


(1 - Kb)'^ 


= I 1 + — 

Vb 


1 


interval ( 0 , 1 ) and ensure the necessary balance (151 between bids and asks that are matched. 

If there are market orders, that is if g^-a^iJ-b > 0, then this results in a rescaling of the distribution (14) 
provided the support of the rescaled distribution remains contained within the interval ( 0 , 1 ). 


5.1.2. Market impact. As a further illustration, consider the case where fa{x) = fb{x) = l,x G (0,1), 
Va = Vb = 1 and pLa, z^b > 0. Use the notation 7 rh(/Xa,/rt,; cc), 7 ra(/ra,/r?,; x) for the solution to equations (12) 
satisfying the required boundary conditions in this case. Then provided p-a/ {1+pib), tib/i^+P-a) < w(s 
this solution has TraiPa,Pb', x) = 7rb(pb, Pa', 1 — x) and 


0.278) 


(16) TTb{pa,Pb',x) = 


TTfa ((1 + Pb)x + Pa{l - X)) 


X G 


}{1 + pb) - Pa 

W + 1 


, 1 - 


}{1 + Pa) - Pb 
W + 1 


^ + Pa + Pb 
where Trb{.) is the earlier solution ([^. 

An important assumption for our mathematical development has been that all orders are for a single 
unit, and an outstanding question concerns the extent to which the model can be generalized. In practice, 
a long-term investor who wishes to buy or sell a large number of units may choose to spread the order in 
line with volume in the market, so as not to unduly move the price against her [5]. We are able to analyze 
the market impact of a particularly simple approach, when the investor leaks the order into the market 
according to an independent Poisson process over a relatively long period, where the market relaxes to the 
new equilibrium dynamics over that period. Thus the impact of a large market order to buy will be to 
increase the parameter pi, to say pb -I- e. As e increases the time taken to complete the order decreases, but 
the impact on the distribution (16) increases, leading to an overall less advantageous trading price. Similarly 
if a large limit order is leaked into the market as an independent Poisson process, this can also modeled by 
a perturbation of equations (12). 

In markets with a relatively small set of participants with large orders there may be advantages in market 
designs where large transactions may be quickly arranged at fixed prices; [7] discuss trading protocols that 
complement limit order books for large strategic investors. 


5.1.3. One-sided markets. Toke |25] has considered a special case where analytic expressions for various 
quantities such as the expected number of bids in a given interval are readily available, as we now describe. 

Suppose that fb{x) = l,a: G (0,1), Va = Pb = 0 and pa > Vb > 0. Thus all bids are limit orders and all 
asks are market orders, a one-sided market. Then r^bix) = Vb/pa,x G {Kb, 1) where Kb = 1 — Pa/vb- And, 
further, for x > Kb the number of bids present in the interval {x, 1), that is B{x, 1), is a birth and death 
process whose stationary distribution is geometric with mean r'b(l — x)/{pa — Vb{l — x)). Thus, for example, 
E[B{x,y)] can be readily calculated. 

Various generalizations are also tractable, provided the market remains one-sided [25]. For example, 
suppose each bid entering the LOB is cancelled after an independent exponentially distributed time with 
parameter 9 unless it has been previously matched. Then the number of bids present in the interval {x, 1) 
is again a birth and death process. Now the entire LOB is a positive recurrent Markov process, and it 
is straightforward to verify that, as 0 ), 0 , bids to the left of Kb are seldom matched and the stationary 
distribution of the rightmost bid approaches Trb{x) = VbjPa,x G {Kb, 1), as we would expect. 
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5.2. Trading strategies. Next we consider a few simple strategies that can be analyzed using our model. 
For simplicity, we present the results for the case when the bid and ask price distributions are equal and 
uniform on (0,1), but the analysis easily extends to other arrival distributions. The limiting densities of the 
rightmost bid and leftmost ask for this model were determined in Corollary |2.3| 


5.2.1. Market making. We begin by considering a single market maker who places an infinite number of bid, 
respectively ask, orders at p, respectively q=l — p, where Kb < p < q < Ka- Thus whenever q is the lowest 
ask price, the trader obtains all bids that arrive at prices above q, and whenever p is the highest bid price, 
she obtains all asks that arrive at prices below p, making a profit oi q — p per bid-ask pair so acquired. Call 
the orders placed by the trader artificial, to distinguish them from the natural orders. The rate at which 
the trader is able to match her orders is proportional to p times the probability that the rightmost bid is 
exactly p. 

Placing an infinite supply of bids at a level below k has no asymptotic effect on the evolution of the LOB. 
For p> K~ 0.218 no ask is accepted at a price less than p, and there will be a positive probability that the 
rightmost bid is exactly p (i.e., there are no bids at prices above p). To find this probability, we consider 
the following alternative model C: there is an infinite supply of bids placed at 0, but the price equivalence 
function V is constant on [0,p]. (Otherwise, the initial state, arrival processes, and price equivalence functions 
coincide in C and C.) In C, the bids and asks above p will interact just as in C, so the probability that the 
rightmost bid is the infinite order in C is equal to the probability that the rightmost bid is at or below p in 
C. Note that only finitely many of the bids placed at 0 will ever be fulfilled in C. By Lemma 3.1 pathwise. 


at all times the difference between the bid/ask queue sizes in C and a limit order book without the infinite 
supply of bids at 0 will be bounded by the overall number of bids departing from that infinite supply. Hence 
the infinite bid at 0 is irrelevant for the analysis of the steady-state distribution of the highest bid, since in 
the limit t —>■ oo the difference will disappear. 

In L, asks at prices below p cannot stay in the book, i.e. Wa{x) = 0 for x < p. By Remark]^ the density 
of the highest bid Wb{x) is equal to \/x on [Kb,p), and to C + log on [p, q] (the latter is obtained as 
in Corollary 2.3). Recall that Wb is continuous, which allows us to determine C and Kb (since zub integrates 
to 1). This allows us to find Kb as 

/I ^ C 
p l-p 

= - - 

e \ p 

and to deduce that the rightmost natural bid has density 


ZUbix) = 


1 


^ I 1 , 1 — a; 

C - + log- 

.X X 


p f l-p 


p 

p < X < q; 


< X < p; 


where C = (1 -l-plog((I — p)/p))~^. The probability the rightmost natural bid isp or less is thus I—C'log((l — 
p)/p), and this is therefore the probability that the rightmost bid is exactly p in the model with infinitely 
many artificial bids at p and infinitely many artificial asks ai q = 1 — p, where k < p < 1/2 < q. 

To maximize the profit rate we need to solve the optimization problem 


maximize 


l-p 


P 


where C = [1 + p log 


{l-2p)p (^1-Clog 
subject to p S [k, 1 /2]. 

The maximum is attained at p « 0.377, and gives a profit rate of « 0.054. 


l-p 


5.2.2. Sniping. We next consider a trader with a sniping strategy: the trader immediately buys every bid 
that joins the LOB at price above q, and every ask that joins the LOB at price below p (with q = l-p still). 
Now the trader has lower priority than the orders already in the queue, but she obtains a better price for 
the orders that she does manage to buy. 

The effect on the LOB of the sniping strategy is to ensure there are no queued bids above q and no queued 
asks below p; for p < q, the set of bids and asks on (p, q) has the same distribution in the sniping and the 
market making model, and therefore the probability that p is the highest bid is the same as in the market 
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Sniping vs. market maker 



P 


Figure 2. Profit from sniping and market making strategies. Solid line is the sniping 
strategy, dashed line is the market making strategy. (Sniping with p < 1/2 is shown for 
completeness; as argued in the text, it does not maximize the profit.) 


making model as well. (The profit rates for the trader are different.) But it also makes sense to consider 
the sniping strategy with p > q, when it ensures that there are no queued orders of any kind in the interval 
{q,p): they are all sniped up by the trader. (An ask arriving at price a G [q,p) cannot be matched with a 
queued bid, because there are no queued bids above q.) The trader makes a net profit of zero on the orders 
in {q,p)] the point of sniping them is to increase the probability of being able to buy a bid at a high price. 

Summarizing, if p > g then the LOB has no queued orders between p and q. Since all the bids are at 
prices below g, and the ask density there is zero, we seefrom Propositionthat the density of the rightmost 
bid is Wb{x) = \/x on [Kb,g); since wi, integrates to 1, we find = g/e. Notice that the distribution of the 
rightmost bid stochastically decreases as g decreases, hence the probability of acquiring an ask at low price 
a < 1/2 increases as g decreases. This shows that the profit rate from sniping bids above g and asks below 
p for p > 1/2 is strictly higher than the profit rate from sniping bids above p and asks below g. Thus, it 
suffices to consider the case of p > 1/2 > g. We thus solve 

pl — p 

I y rr 


maximize 


subject to 


-'Kb 

pe [1/2,1]. 


(1 — 2 x) log —dx 
Kb 


where 


Kb = 


1 -p 


The maximum is attained at 1 — p = g = e/(e^ + 1) ~ 0.324 and gives a profit rate of « 0.060. 

Figure [^presents a comparison between the profit rates from the market making and sniping strategies, 
as a function of p (which, recall, is the price below which the trader would like all asks) - for completeness, 
p < 1/2 is included for the sniping strategy as well. 


5.2.3. A mixed strategy. It is possible to consider a mixture of the above strategies: the trader places an 
infinite supply of bids at P (thus acquiring all asks that arrive below P whenever P is the highest bid price), 
but in addition attempts to snipe up all the additional asks that land at prices x < p. We assume the trader 
gets the best of the two possible prices when both p and P are larger than the price of the arriving ask. 
There are several possible cases corresponding to the relative arrangement of p, P, and 1/2: 

(1) If p < P (this means that there are no additional asks to snipe up), this degenerates to the mar¬ 
ket maker strategy, with a profit of (I — 2P) per bid-ask pair bought, with pairs bought at rate 
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Figure 3. Profit rate from the mixed strategy as a function of sniping threshold p and 
infinite bid order location P. 


P\og{P/ Kb). (The probability of the highest natural bid being below P is log(P//Cb); when it is 
there, asks arrive at prices below P at rate P.) Clearly, one wants P < 1/2 in this case, otherwise 
the profit is negative, so we can write this case as p < P < 1/2. 

(2) If P < p < 1/2, then one gets additional asks at price x at rate log{x/Kb), for a profit of (1 — 2x), 
for all X from P to p. 

(3) If P < 1/2 < p, there are two further cases: we may have P<1—porP>l — p. 

(a) If P < 1 — p < 1/2 < p < 1 — P, then the trader snipes all orders between 1 — p and p for a 
net profit of 0. Profit (1 — 2P) from a bid-ask pair matching the infinite orders is generated 
at rate Plog((l — p)/Kb), and profit 1 — 2x, P < x < 1 — p, from sniping is generated at rate 
1 + \og{x/Kb). By Remark]^ the highest bid density is 1/a: on {nt, 1 — p], so Kb = (1 — p)le. 

(b) Ifl—p<P<l/2<l — P<p, then P is always the best bid, which means that the trader gets 
all the asks that arrive below P, generating profit at rate (1 — 2P)P. Orders arriving between 
P and 1 — P cancel each other, and all the asks arriving between 1 — P and p are bought up 
for a loss (negative profit) of (1 — 2a:). 

(4) Finally, the case P > 1/2 is silly, because every bid-ask pair bought will be bought at a loss. 

Figurej^shows the profit for the two-parameter space. The largest profit is obtained when P = 1—p = 1/4, 
and the profit is then acquired at rate 1/8 = 0.125. This corresponds to the trader placing an infinite bid 
order at 1/4 (thus buying all asks that arrive with price below 1/4 for 1/4), an infinite ask order at 3/4, and 
sniping up all orders that join the LOB at prices between 1/4 and 3/4. 


5.3. Competition between traders. Finally we comment on the situation that arises when multiple 


traders compete using the simple strategies described in Section 5.2 


Consider first the case of two competing traders, the first of whom has the ability to employ a sniping 
strategy of the form described in Section |5.2.2[ and the second of whom cannot act quickly enough to snipe 
but does have the capacity to employ a market making strategy of the form described in Section |5.2.1| 
Suppose then the market maker places an infinite number of bid, respectively ask, orders at P, respectively 
1 — P, where P < 1/2. And suppose the sniper immediately buys every bid that joins the LOB at price 
above q, and every ask that joins the LOB at price below 1 — q, where P < q < 1/2. 
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For given P and q the behavior of the LOB is as analyzed in Section |5.2.3| but incentives for the two 
traders are different. Given P and q the profit rate for the sniper is /p(l — 2a:) ln{ex/q)dx and for the market 

maker (1 — 2P)P 1/x dx = (1 — 2P)P \og{eP/q) provided P > = q/e. li P < q/e the market maker’s 

orders are outside of the recurrent range («(,, 1 — Kb) of the LOB and so are not matched. It is natural to 
suppose the sniper follows the market maker: that is the sniper observes the choice P of the market maker 
and chooses q accordingly. The maximizing choice for the sniper is then q = ^P{1 — P). Given this, the 
optimum choice for the market maker, that maximizes his profit rate, is P « 0.340. At this equilibrium, the 
profit rate of the market maker is 0.073 and of the sniper 0.020. 

Consider next the case of two or more traders using either the market making strategies of Section [5.2.11 
or the mixed strategies of Section |5.2.3| Each trader will have an incentive to improve slightly the prices 
at which she places infinite orders of bids and asks, thus gaining all the profit from those orders for herself 
alone. The Nash equilibrium has the traders compete away the bid-ask spread and with it all their profits. 
The model becomes an example of the Bertrand model of price competition and, as there, the conclusion is 
softened with more realistic assumptions on, for example, capacity constraints or cost asymmetries. 

Next consider competition between two or more traders using the sniping strategies of Section [5.2.2[ for 
example between traders who attempt to snipe a limit order as it arrives with an exactly matching limit 
order. If multiple traders attempt to simultaneously snipe the arriving order, than one of them will succeed 
and the others will cancel their own orders immediately as they detect that their orders have not been 
successful. There is clearly an advantage for a trader who can snipe an arriving order more quickly than 
the other traders, and indeed such a trader can enforce the optimum sniping strategy of Section |5.2.2| and 
exclude slower traders from the market. It has been argued that competition on speed is wasteful (see 0 ), 
and there are proposals to encourage traders to compete on price, rather than speed, as for example in the 
proposal of [4] where a market continuous in time is replaced with frequent batch auctions, held perhaps 
several times a second. We shall explore the consequences of competition on price between sniping traders 
who can all react at the same speed to a new order entering the LOB. 

In such a competitive environment traders will have an incentive to increase the price q above which they 
snipe bids, and decrease the price p below which they snipe asks, towards 1/2: they will refrain from sniping 
orders on which they would expect to make a loss. At the Nash equilibrium, each trader will snipe at all asks 
with prices below 1/2 and at all bids with prices above 1/2 (getting the order with the same probability as 
each of the other traders). The rightmost bid will then have density 1/x on (l/(2e), 1/2) by Remark]^ This 
results in a combined profit rate l/(2e) — (1 -|- e^)/(8e^) « 0.042. Thus price competition between sniping 
traders has decreased their combined profit rate only slightly, from 0.060 to 0.042. Alternatively, one can 
view this reduction as the effect of a batch rather than a continuous market. 

Next we comment on the impact of traders on the bid-ask spread. The mean of the distribution ([^ can be 
calculated and is simply (1 — k)/2. Thus without traders the mean spread between the highest bid and the 
lowest ask in the LOB is k « 0.218, while the maximum spread is 1 — 2k « 0.564. At the Nash equilibrium 
between sniping traders both are increased, the mean spread to 1/e ~ 0.368 and the maximum spread to 
1 — 1/e « 0.632. For comparison, with a single sniping trader both are further increased, the mean spread 
to 1 — 2(e — l)/(e^ -I- 1) « 0.590 and the maximum spread to 1 — 2/(e^ -I- 1) « 0.762; and at the equilibrium 
between a market making trader and a sniping trader the mean and maximum spread are as low as 0.228 
and 0.320 respectively. These calculations are of course for a specific example, but they do illustrate the 
tractability of the model and its insights. 

As a final remark we comment on the inventory of traders under the Nash equilibrium between sniping 
traders described above. Observe that the LOB below 1/2 evolves independently of the LOB above 1/2, and 
both processes are positive recurrent inside their corresponding thresholds. Consider the net position of the 
traders collectively, that is all the bids they have matched minus all the asks they have matched, observed at 
those times when the LOB is empty. This evolves as a symmetric random walk, and is null recurrent. Slight 
variations of the traders’ strategies would moderate this conclusion: for example, a trader might refrain from 
sniping bids close enough to 1/2 when his net position is large. And of course such variations will be essential 
over longer time-scales than those considered in this paper where the arrival price distributions may vary. 
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